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ESSENTIAL INSTRUMENT 
FOR Al RESEARCH 


The Challenges of Building a Platform for Al 


Data scientists depend on computing performance to gain insights and innovate 
faster, using the power of deep learning and analytics. GPU technology offers a faster 
path to Al, but building a platform goes well beyond deploying a server and GPU's. 


Al and deep learning can require a substantial commitment in software 


engineering. An investment that could delay your project by months as you SYSTEM SPECIFICATIONS 
integrate a complex stack of components and software including frameworks, ae BX Teste V 100) 6a Testa P100 
‘ 3 i pe z F Performance 1 petaFLOPS | 170 teraFLOPS 
libraries, and drivers. Once deployed, additional time and resources are continually (GPU FP16) 
needed as you wait for the ever-evolving open source software to stabilize. You'll GPU Memory 128 GB total system 
also be waiting to optimize your infrastructure for performance, along with CPU Dual 20-Core Intel Xeon 
ease : : E5-2698 v4 2.2 GHz 
administrative costs that increase as the system scales. 
NVIDIA CUDA? Cores 40,960 28,672 
. NVIDIA Tensor Cores 5,120 N/A 
The Fastest Path to Deep Learning (on V100 based 
: f o n systems) 
Inspired by the demands of Al and data science, NVIDIA? DGX-1™ fast-tracks your Naxiiauen ewer 3,200 W 
Al initiative with a solution that works right out of the box so that you can gain Requirements 
insights in hours instead of months. With DGX-1 you can simply plug in, power System Memory 512 GB OMN DDR4 
up, and get to work, thanks to the integrated NVIDIA deep learning software stack Sorge LX 1.92 TB SSD RAID 0 
and DGX-1 cloud management services. Now you can start deep learning training Network Dual 10 GbE, 4 IB EDR 
in as little as a day, instead of spending months integrating, configuring, and Software Ubuntu Linux Host 0S 
troubleshooting hardware and software. SEL Sontware Stack PODAIS 
System Weight 134 lbs 
Effortless Productivity System Dimensions 866 D x 444 W x 131 H (mm) 
Packing Dimensions 1,180 D x 730 W x 284 H (mm) 
NVIDIA DGX-1 removes the burden of continually optimizing your deep learning Operating 10-35 °C 


software and delivers a ready-to-use, optimized software stack that can save you Temperature Range 


hundreds of thousands of dollars. It includes access to today’s most popular deep 
learning frameworks, NVIDIA DIGITS™ deep learning training application, third-party 
accelerated solutions, the NVIDIA Deep Learning SDK (e.g. cuDNN, cuBLAS, NCCL), 
CUDA” toolkit, NVIDIA Docker and NVIDIA drivers. 


Revolutionary Al Performance 


While many solutions offer GPU-accelerated 
performance, only NVIDIA DGX-1 unlocks the full 
potential of the latest NVIDIA GPU's like the NVIDIA 
Tesla® V100, including innovations like next generation 
NVLink™ and new Tensor Core architecture. With its 
performance-engineered deep learning software stack, 
DGX-1 delivers up to three times faster training speed 
than other GPU-based systems. With the computing 
capacity of 25 racks of conventional servers in a single 
system that integrates the latest NVIDIA GPU technology 
with the world’s most advanced deep learning software 
stack, you can take advantage of revolutionary 
performance to gain insights faster than ever, powered 
by NVIDIA DGX-1. 


Investment Protection 


Your Al initiative is critical to your organization's 
success, and dependent on a frequently optimized 
software stack and integrated hardware infrastructure. 
With today’s rapidly evolving open source software and 
the complexity of libraries, drivers, and hardware, it’s 
good to know that NVIDIA's enterprise grade support 
and software engineering expertise are behind every 
DGX-1. This software stack is built on years of R&D, 
innovation, and deep learning expertise, and maintained 
by monthly optimized framework releases. Also, 
NVIDIA's support includes software upgrades and 
priority resolution of critical issues; you can have peace 
of mind that your environment is tuned for maximized 
performance and uptime. 


For more information, visit www.nvidia.com/dgx-1 


NVIDIA DGX-1 Software Stack 


DEEP LEARNING FRAMEWORKS 


2 
E CNT PYTÖRCH 


Caffe 


P TensorFlow mxnet theano "YP torch 


DEEP LEARNING USER SOFTWARE 
NVIDIA DIGITS™ 


CONTAINERIZATION TOOL 
NVIDIA Docker 


GPU DRIVER 
NVIDIA Driver 


SYSTEM 
Host OS 


NVIDIA DGX-1 Delivers 96X Faster Training 


DGX-1 with Tesla V100 7.4 hours, 96X faster 


8X GPU Server 18 hours, 40X faster 


CPU-only Server |] 711 hours 


ox 10X 20X 30X 40X 50X 60X 70X 80X 90X 100X 


Relative Performance (Based on Time to Train) 
Workload: ResNet50, 90 epochs to solution | CPU Server: Dual Xeon E5-2699 v4, 2.6GHz 
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